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foreword 


Th.  «>rk  docu«nt.d  in  thi.  report  .an  p.rfor»d  by  Dr. 

».  D.  Montgo«ry  and  Dr.  P.  ».  Brooi«  of  the  Electronica 
Renearch  Laboratory  of  Convair/intronautlca.  It  ban  been 
included  an  a  part  of  the  BAMBI  atudy  for  reasons  of  adnini- 
stratiee  expediency  in  order  to  eliminate  the  necessity  of  the 
monitoring  of  two  contracts  although  it  .as  negotiated  in¬ 
dependently  ,1th  different  motivations  and  objectives. 

This  work  .as  intended  as  a  generalized  study  of  the  space¬ 
time  filtering  concept  rtiich  is  applicable  to  electro-optical 
detection  systems.  Thus,  although  it  may  he  applicable  to 
the  BIMBI  study,  the  technical  approach  and  the  results  .ere 

act  constrained  by  it  and  should  not  he  Interpreted  .ithin 
its  context. 
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ABSTRACT 


The  pp»a«nt^ study  is  oriented  toward  enhancing  the  detection 
of  a  missile  launch  as  seen  from  a  satellite.  Some  prelimi¬ 
nary  considerations  of  satellite  altitude,  wave-lengths  viewed, 
and  optical  system  used,  show  up  the  more  important  character¬ 
istics  of  the  problem  on  the  image  plane  of  the  optical  system. 
Considering  the  intensity  distribution  on  this  plane  as  the 
input,  three  important  spatial  filters  are  developed,  having 
two-dimensional  outputs.. 


The  first  filter  is  a  linear  one  whose  simulation 

y  i'  :  ■ 

eartsrvcbnrat  on  an  IBM  7000  The  siiwilwtien  uses, a  localized 
target  against  a  random  and  non_random  background.  The  inputs 
and  outputs  are  shown, as  taken  -from  a  CharactBron  -tube-.  The 
second  filter  iB  a  general  statistical  one  wli^«h  has  consider- 
able  discrimination  ability  but  needs  a  large  amount  of 

statistical  information  of  the  background.  The  third  filter^ 
is  developed  along  the  lines  of  decision  theory  <aai(  turns 
out  to  be  remarkably  similar  to  the  r ** i ~~i  iriiir  1  y  mr  nti  imid 
general  one. 


For  each  filter  the  application  to  multiple  wave  lengths  as 
3®  to  point  source  and  non  point  source  targets  is  con¬ 
sidered.  The  use  of  several  filters  combined  in  a  serial  or 
parallel  fashion  is  mentioned. 
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1.  INTRODUCTION 


The  inaediate  problem  that  the  present  study  is  concerned  with 
is  the  detection  of  a  missile  launch  as  seen  from  a  satellite. 
The  satellite  detection  system  will  utilize  data  presented  to 
it  on  an  image  plane  and  will  absorb  the  data  by  way  of  an 
array  of  photo  cells,  a  scanning  photo  cell  or  cells,  a  tele- 
vision-type  scan,  or  some  such  similar  technique. 

The  main  emphasis'  of  the  study  is  on  possible  filtering  ac¬ 
tions  that  can  be  performed  on  the  image  plane  intensity  as 
an  input.  Such  filtering  will  henceforth  be  carried  out  by 
what  is  termed  a  "spatial  filter"  whose  design  is  based  on 
certain  a  priori  knowledge  of  the  target  and  background. 

The  purpose  of  the  filter  is  to  enhance  the  detection  of  the 
target.  Filters  which  are  discussed  are  of  two  types,  the 
first  type  does  processing  in  which  the  output  is  obtained 
by  performing  mathematical  operations  on  the  input.  The 
second  is  a  decision  filter  which  votes  according  to  a  given 
criterion  as  to  the  absence  or  presence  of  a  target  in  a  given 
neighborhood  of  the  image  plane,  thereby  partitioning  the  out¬ 
put  into  two  levels,  absence  or  presence.  Both  types  involve 
a  mapping  of  the  input  plane  onto  the  output  plane. 
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The  data  which  is  seen  at  the  iuge  plane  haa  already  had 
Bome  processing  which  wust  be  considered  in  the  analysis.  This 
is  dependent  upon  the  wave  lengths  viewed,  the  optical  iwag- 
ing  systew  employed,  and  the  satellite  altitudes  considered. 
These  are  all  external  variables  which  deterwine  sowe  of  the 
general  properties  of  the  incoherent  light  intensity  distri¬ 
bution  on  the  iwage  plane  of  the  optical  systew.  The  effect 
of  these  variables  is  to  place  constraints  on  the  intensity 
distribution  in  the  iwage  plane. 

At  launch  the  diaensions  of  the  pluwe  of  an  Atlas  aissile 
about  Mtch  those  of  the  missile  itself  (80  feet  by  10  feet). 
Thus  from  a  satellite  the  missile  at  launch  would  most  likely 
appear  as  a  point  source  in  the  infrared^  Out  of  the  atmos- 
phere  the  plume  can  be  expected  to  increase  its  dimensions 
by  at  least  ten  times,  and  this  combined  with  its  possible 
decreased  distance  makes  it  no  longer  a  point  source.  This 
addition  of  structure  to  the  target  as  seen  in  the  optical 
image  is  of  importance  as  more ^complex  filter  criteria  may 
be  used  to  advantage.  In  general,  structure  is  always  pres¬ 
ent  when  the  tine  dimension  is  included,  even  for  a  spatial 
point  source  if  its  velocity  is  not  zero. 
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Thia  point  may  be  illustrated  as  follows:  In  Figure  1  the 
usual  Raleigh  criterion  for  resolution  gives  the  resolvable 
dimension,  r,  as  a  function  of  the  distance,  D,  for  the  two 
wave  lengths- =  3,5  microns  (infrared)  and  X  =  .5 
microns  (visible).  The  objective  aperture  is  assumed  to  be 
about  one  meter. 

Since  the  plume  radiation  is  most  intense  in  the  infrared 
(2.78^  and  4, 2/6/  ),  this  and  a  typical  visible  wave  length 
have  been  chosen  as  being  representative. 

The  resolution  distance  available  on  the  image  plane  is  de¬ 
termined  by  the  wave  length  used  as  well  as  the  aperture  and 
focal  length  of  the  optical  system.  For  an  aperture  of  one 
meter  and  a  focal  length  of  four  meters,  the  two  wave  lengths 
give  resolution  distances  of  ,02  mm  for  3,5^  and  ,002  mm 
for  ,5^  ,  Even  though  a  scanning  (say  100  lines  per  mm) 
of  the  image  plane  may  not  be  able  to  take  full  advantage  of 
this  resolution  the  two  cases  previously  mentioned  for  the 
target  will  still  persist,  namely  the  point  source  and  the 


finite  size. 


I.l  INFORMATION  CONSIDERATIONS 

Th«  input  to  the  spatial  filters  is  a  two-disenslonal  intensity 
distribution  on  the  inage  plane  of  an  optical  systenio  Sone  r*- 
■arks  are  in  order  concerning  the  output  of  such  a  filter.  In 
particular,  if  the  filter  is  considered  as  an  information  chan¬ 
nel  it  would  be  desirable  to  keep  the  information  loss^^^ 

( eQuivocation )  down  unless  considerable  data  reduction  is  an¬ 
ticipated  within  the  filter  itself.  If  the  dimepsion  of  the* 
output  were  lower,  say  one-dimensional  with  intensity  a  func¬ 
tion  of  time,  then  it  can  be  shown  that  there  would  be  either 
considerable  information  loss  or  loss  of  continuity  between 
input  and  output.  The  best  that  such  a  filter  could  do  to 
minimize  information  loss  alone  would  be  to  establish  a  one- 
to-one  correspondence  between  the  input  plane  and  the  output 
line,  which  can  be  done. 

A  topological  theorem  tells  us  that  in  such  a  case  the  con¬ 
dition  of  bicontinuity  cannot  hold  simultaneously.  This 
implies  for  the  present  circumstance  that  a  localized  target 
on  the  plane  could  not  produce  a  localized  output  on  the  line 
and  this  would  be  a  serious  loss  for  detection  systems.  This 
one-to-one  correspondence  would  involve  no  information  loss 
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fn  a  technical  sensea^^^  The  on^y  way  that  aoae  dagree  of 
continuity  could  be  aalntained  would  be  1^,0  wake  the  napping  a 
■any-to-one  type  (eag.,  aee  Figure  2)o  This  neans  that  a 
localized  target  on  the  plane  would  produce  a  nore  localized 
output  on  the  line  but  aany  input  points  would  be  napped  onto 
the  same  output  point <>  This  would  inply  that  many  distinct 
inputs  would  produce  the  same  output  giving  rise  to  an  in¬ 
formation  loss..  From  the  above  considerations  it  would 
appear  desirable  to  have  a  two-dimensional  output  for  the 
spatial  filterso  Most  of  the  filters  considered  will  be 
of  this  typeo 

This  is  not  meant  to  imply  that  the  data  must  be  processed 
in  a  parallel  fashion..  Serial  processing  with  retention  of 
dimension  does  not  constitute  an  information  loss  process. 
lo2  CONTINUOUS  AND  DISCRETE  DESCRIPTIONS 
The  intensity  distribution  on  the  image  plane  is  of  a  con¬ 
tinuous  nature  but  due  to  the  resolution  limitation  it  is 
actually  discrete  and  in  fact  finite.  One  way  this  can  been 
seen  is  by  a  consideration  of  the  sinusoidal  intensity  waves 
that  add  up  to  form  a  two-dimensional  picture This  is 
its  Fourier  description  and  usually  involves  the  summation  of 
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Figure  2 

Localization  in  a  Many-one  Map 


an  Infinite  number  of  the  sinusoidal  waves.  Since  in  the 

present  situation  there  is  a  finite  resolution  distance,^  , 

on  the  image  plane  there  is  no  need  for  waves  In  the  Fourier 

synthesis  of  wave  length  ^  Such  waves  can  only  aid  in 

the  description  of  detail  not  found  on  the  image  plane o  Thus 

the  Fourier  series  is  finite  and  we  say  the  picture  is  band 

limited  with  an  upper  frequency  limit  (cycles/cm.)  of  W  =  — ^ 

2  ^ 

Shannon's  sampling  theorem^^^  applied  to  two  dimensions  shows 
how  the  picture  can  be  completely  described  from  a  knowledge 
of  the  intensities  at  the  points  of  a  finite  square  grid  of 
separation  spacing  The  information  at  the  intersections 

on  this  grid  may  replace  the  input  plane  of  a  spatial  filter. 
This  result  correlates  well  with  practical  considerations 
since  a  scanning  mechanism  will  no  doubt  be  used  to  record 
the  input  data  and  such  mechanisms  are  obviously  limited  to 
so  many  lines  per  mm.  Similar  arguments  show  that  the  output  . 
plane  may  be  replaced  by  a  grid  of  mesh  no  finer  than  that  of 
the  input  grid, 

1,3  THE  NEIGHBORHOOD  MODIFICATION  PROCESS 

Assuming  that  the  input  and  output  grids  are  superimposable , 

the  general  form  of  the  spatial  filter  can  be  derived  from  a 


/•w  simple  considerations.  First  of  all  slacewe  are  interest¬ 
ed  in  the  detection  of  a  localized  object,  positional  infor- 
mation  should  not  be  loot  by  the  filtering  action.  This  is 
assuming,  of  course,  that  the  size  of  the  target  is  relatively 
small  in  comparison  to  the  whole  field  of  view.  Secondly,  any 
statistical  description 'Of  the  background  will  show  that  cor¬ 
relation  holds  for  relatively  small  distance,  again  in  compari¬ 
son  to  the  whole  field.  By  this  we  mean  that  the  intensity 
level  at  one  place  on  the  image  plane  is  not  highly  correlated 
with  that  at  another  place  if  they  are  very  far  apart.  These 
statements  will  be  made  more  precise  in  the  section  on  filters 
where  mathematical  notation'  i-s  introduced. 

Prom  these  two  observations  it  seems  clear  that  the  connection 
between  the  two  grids  should  be  as  follows;  The  intensities 
in  the  neighborhood  of  an  input  grid  point,  P,  should  alone 
determine  the  intensity  at  the  output  grid  point,  P* ,  where 
P*  corresponds  to  P  under  the  assumed  super imposability.  This 

■  •  a  • 

mapping  of  a  neighborhood  of  input  intensities  onto  an  output 
intensity  is  termed  the  "neighborhood  modification  process" 

« 

(N.MoP.),  The  size  of  the  neighborhood  should  be  large  enough o 


to  contain  the  target  as  well  as  distances  over  which  there 
is  significant  correlation  in  the  backgnound. 

Thus  the  total  filtering  action  can  be  described  by  a  function 
f(x)  where  x  is  an  N-tuple  vector  whose  components  are  the 
intensities  at  the  points  of  the  grid  ordered  in  some  way  in 
the  neighborhood  of  the  point,  P.  For  convenience  the  neigh¬ 
borhood  will  be  chosen  in  the  form  of  a  square  with  P  at  the 
center  and  the  points  will  be  ordered  from  left  to  right  and 
top  to  bottom.  Thus,  f(x)  =  y  ds  the  intensity  at  P*  on  the 
output  grid.  This  is  illustrated  in  Figure  3, 

1.4  MULTIPLE  WAVE  LENGTH  PROCESSES 

In  the  combustion  process  of  a  rocket  plume,  as  in  most  steady 
state  burnings,  there  is  a  definite  correlation  between  the 
infrared  emission  at  one  place  in  the  plume  and  the  visible 
emission  at  another  place.  This  cross-correlation  of  intensi- 
ties  from  different  wave  lengths  is  in  addition  to  correlations 
occurring  within  any  one  wave  length.  This  cross-correlation 
will  also  be  present  in  the  background  (earth  and  cloud  radia¬ 
tion)  and  thus  additional  discrimination  can  be  gained  by  the 
use  of  multiple  wave  length  spatial  filters.  The  incorporation 
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Figure  3 

Neighborhood  Modification  Process 
The  intensities  in  the  neighborhood  are  transformed  and 
mapped  to  a  single  point  in  the  output  plane. 


scheae  does  not  increase 


of  sore  than  one  wave  length  into  the 
the  theoretical  difficulties  by  very  much  but  in  the  iapleuenta- 
tion  it  would  undoubtedly  pose  additional  probleas.  For  in¬ 
stance.  one  optical  systea  aight  alternately  illuainate  two 
different  input  planes,  each  coated  with  a  photosensitive  aat- 
erial  responding  to  that  particular  wave  length. 

The  infrared  and  visible  radiations  tend  to  coapleaent  each 
other  as  to  resolution  and  intensity.  Host  of  the  pluae  radia- 

tion  is  in  the  I.R.  but  the  resolution  is  considerably  better 
in  the  visible. 

1.5  RETICLE  SYSTEMS 

There  is  a  collection  of  mechanical  devices  which  are  used  to 
detect  the  presence  of  a  point  source  against  a  background  of 
clouds,  etc.  These  reticle  systems,  as  they  are  called,  employ 
a  "chopping" action  which  usually  consists  of  a  collection  of 
straight  edge  stops  moving  normal  or  parallel  to  their  edges. 

This  motion  produces  an  off  and  on  switching  over  the  total 
field  according  to  some  pattern.  Figure  4  illustrates  a 
simplified  rotating  reticle.  Some  analysis  of  these  devices 
has  been  done  in  the  Fourier  and  Laplace  planes.  It  is 
clear  from  an  examination  of  the  image  plane  itself  that  their 
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.Figure  4 
Rotating  Reticle 


utility  liea^  in  the  ability  to  pulse  on  and  off  a  point 
source  intensity  while  at  the  same  time  tending  to  suppress 
such  pulsing  of  high  intensity  gradients  that  appear  along 
edges  whose  average  radius  of  curvature  is  not  too  small. 

The  detection  of  the  point  source  is  thus  reduced  to  that  of 
detecting  a  pulsation  in  the  integrated  output  intensity  from 
the  reticle o  A  clear  advantage  of  most  of  these  systems  is 
their  mechanical  simplicity.  Two  serious  defects  are  their 
inability  to  take  advantage  of  much  a  priori  knowledge  con¬ 
cerning  the  background  the  their  inability  to  supply  good 
positional  information  unless  high  scan  rates  are  used.  The 
generalization  of  reticle  systems  to  include  non  point 
sources  is  of  questionable  value  and  would  tend  to  reduce  to 
ordinary  template  matching.  The  spatial  filters  discussed 
in  this  report  can  utilize  considerable  a  priori  background 
information  and  detect  non  point  sources  as  well  as  point 

sources.  They,  of  course,  will  also  tend  toward  more  com¬ 
plexity. 


2.  DISCUSSION 


There  are  a  variety  of  ways  to  proceed  toward  the  design  of 
a  spatial  filter  and  all  of  these  depend  upon  a  priori  choices 
of  the  mapping  function  f (x) ,  the  optimization  criterion,  and 
the  background  and  target  descriptions.  For  each  filter  which 
is  discussed,  the  single* as  well  as  multiple  wave  length  cases 
*111  be  considered.  Also  the  targets  will  be  both  point  source 
and  non  point  source  types  with  the  latter  being  of  fixed 
shape  or  having  a  statistical  distribution  of  shapes.  The 
optimization  criterion  employed  is  that  of  maximum  transmitted 
peak  target  to  r.m.s*  background.  One  reason  at  least  for  using 
this  criterion  is  that  in  the  linear  filter  it  makes  use  of 
signal  shape'  whereas  other  criteria  such  as  Weiner's  do  not.^®^ 

The  free  variables  that  .are  left  after  the  above  constraints 
are  imposed  are  the  form  of  the  mapping  function  f(x)  and  the 
background  description.  These  have  a  complementary  relation¬ 
ship  such  that  if  the  mapping  function  is  completely  prescribed, 
as  in  a  polynomial  filter,  the  background  description  necessary 
is  fixed.  If  the  background  is  completely  described  statisti¬ 
cally,  as  in  the  general  filter,  then  the  form  of  f(x)  is  do- 
These  two  situations  as  well  as  a  decision  filter 
will  be  described  in  what  follows. 


2.1  POLYNOMIAL  FILTERS 

2.1.1  The  Linear  Filter 

In  <u>  elnctronic  U„.„  filter  .hone  input  and  output  are 
rolt.g.0.  th.  roltng.  nt  ti«  t^  i,  detenninod  b,  .  u„..r 
weighting  of  the  input  prior  to  time  t^.  Thin  continuoun 
weighting  is  perforued  by  nn  integral  operation.  If  a  finite 
discrete  weighting  is  desired  this  can  be  perfomed  by  a 
sus»ation  of  past  .slues  using  a  fixed  set  of  weighting  coef¬ 
ficients.  These  two  cases  are  shown  in  Figure  5.  This  opsra- 
tion  correspond,  to  either  a  delay  line  filter  or  to  a  sampled 
data  digital  filter.  In  the  case  of  spatial  filters  the  dis. 
Crete  weighting  procedure  ha.  co.plete  two  di«n.ion.l  freedo. 
in  contrast  to  the  restricted  one  di.ension.1  freedo.  of  th. 
previous  filter. 

Figure  6  illustrates  the  necessary  and  sufficient  form  of  f(x) 
in  order  that  the  spatial  filter  be  linear  for  the  ease  of  a 
5  X  5  neighborhood.  A  linear  filter  is- one  in  .-hich  a  linear 
combination  of  inputs  produces  as  an  output  the  same  linear 
combination  of  the  corresponding  individual  outputs.  The  ‘ 
8ignal-to-noise  ratio  for  the  linear  filter  is: 


tinuous  Weighting 


o 
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It  is  simpler  to  maximise  the  square  or 


(1) 


(2) 


where  the  summations  are  carried  out  over  the  neighborhood 
square  and  the  bar  denotes  an  average  carried  out  over  an 
ensemble  of  inputs.  The  c^  are  the  unknown  weighting  coef¬ 
ficients,  the  s^  are  signal  (target)  values,  and  the  n^  are 
noise  (background)  values.  Here  the  signal  is  assumed  to  have 
a  known  shape  and  in  fact  peaks  at  the  point  P,  the  center 
of  the  neighborhood.  The  denominator  of  R  can  be  written  as 
ij  where  =  n^n^  is  the  autocorrelation  of  the 

background .  ^  =  is  a  positive  semidefinite  matrix. 

So  far  two  averages  are  indicated,  n^  =  and  n^  »  . 

so  that  some  comment  concerning  the  ensemble  mentioned  above 
IS  in  order.  All  of  the  spatial  filters  treated  in  this  re¬ 
port  are  potentially  adaptive.  This  means  that  the  parameters 
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inrplTed  in  the  napping  function  f(x)  fin  the  linear  case  the 
can  be  changed  from  time  to  tine  as  new  infornation  about 
the  background  is  acquired.  This  involves  an  assumption  of 
stationarity  in  the  background  statistics  over  a  period  of 
time  which  includes  that  time  used  in  recording  an  ensemble 
of  background  inputs  and  taking  appropriate  averages  as  well 
as  the  time  during  which  the  filter  uses  this  information  to 
enhance  its  detection  ability.  Such  an  assumption  is  always 
inherent  in  any  statement  concerning  the  use  of  a  priori  back¬ 
ground  data.  If  a  sharp  discontinuity  in  the  background  statis¬ 
tics  occurs  at  some  time  then  it  is  clear  that  stationarity 
.does  not  hold  and  any  adaptive  system  would  tend  to  have  a 
"blind  spot"  for  a  short  time  unless  corrected  by  some  ad¬ 
ditional  memory  scheme.  Such  a  thing  would  be  partly  true  for 
any  human  observer  and  so  the  difficulty  cannot  be  completely 
overcome. 

For  the  present  linear  case  there  are  devices  that  will  com¬ 
pute  the  autocorrelation  of  a  two-dimensional  input  and 
which  involve  no  moving  parts,  Eppler  and  Darius^®^  have 
devised  such  an  autocorrelator  which  is  shown  in  Figure  7, 

The  A  and  B  are  two  transparencies  of  the  same  picture  where 
A  is  uniformly  illuminated  from  the  left  so  that  the  intensity 


reaching  point  P  on  screen  C  by  way  of  B  is  proportional  to  the 
autocorrelation  for  the  vector  OP.  This  depends  upon  two 
things  s 

(a)  The  spacings  "a**  and  "b**  must  be  such  that  the 

scaling  ratio  between  transparencies  is  . 

D 

(b)  The  geometrical  fact  that  all  straight  lines  in¬ 
tersecting  the  set  of  three  parallel  planes  A,  B, 

C  are  cut  proportionately. 

( 7) 

One  shortcoming  of  this  one  as  well  as  others  is  that  they 
need  at  least  one  transparency  of  the  input  for  their  opera¬ 
tion.  If  this  obstacle  can  be  overcome  the  optical  auto¬ 
correlator  would  appear  to  be  a  fairly  simple  device.  Digital 
computer  methods  for  computing  autocorrelation  have  been  found 
to  be  easily  instrumented  and  are  available  as  an  alternate 
method. 

Returning  to  equation  (2)  in  an  attempt  to  maximize  the  ratio 
R,  there  are  two  exceptional  cases  that  should  be  mentioned 
first.  There  might  conceivably  exist  a  non  trivial  set  of  c^ 
such  that  j  -  which  would  imply  of  course  that 

^  is  not  a  positive  definite  matrix.  In  such  a  case  the 
"best"  choice  of  the  c^  would  be  one  of  this  type.  Another 


exceptional  case  is  for  R  to  be  unbounded  for  non  trivial  c  : 

i 

here  the  choice  on  the  C|  is  unlimited.  It  is  assumed  in  what 
follows  that  the  exceptional  eases  do  not  occur  so  that  5 
a  positive  definite  Mtrix  and  an  absolute  maximum  for  R  must 
be  faund  among  the  solutions  to  the  equations. 


Br 

a^^n 


0 
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1.  2. 


•  •  • 


(3) 


This  procedure  is  carried  out  in  Appendix  A  with  the  resulting 
vector  solution. 

I  •  «  5:  (4) 

The  k  is  the  coefficient  column  vector  consisting  of  the 
critical  c^^,  the  has  components  =  s^  * ^  and  "a” 
is  an  arbitrary  constant.  Since  J  is  assumed  positive  def¬ 
inite  it  is  non  singular  and 

k  =  al'^£  (5) 

It  is  shown  in  Appendix  A  that  a  necessary  and  sufficient  con¬ 
dition  that  this  represent  an  absolute  maximum  is  that  the 

positive  semi -definite.  Here  oC  =  CT* 
the  transpose. 

only  one  value  of  R  since  R  is  a  homo- 
Cj^  of  degree  zero. 


matrix  ^  be 

^  and  primes  denote 

Solution  (5)  represents 
geneous  function  of  the 


In  the  ease  of  a  point  source  all  of  the  are  zero  but  the 
central  value  corresponding  to  the  center  of  the  neighborhood 

7) 

square.  Thus  the  size  of  the  square  in  this  case  is  deternined 
solely  by  background  correlation.  Independent  of  the  fact  that 
a  conplete  specification  of  the  forn  of  f(x)  =  c’x  deiaands  a 
knowledge  of  the  autocorrelation  J  of  the  background,  we 
can  speak  of  a  statistical  measure  of  background  correlation. 
Such  a  measure  is  necessary  if  one  is  to  choose  the  size  of 
the  neighborhood  square  in  an  intelligent  way. 

Let  p^(a:x,b)  denote  for  the  background  the  probability  that 
given  the  free  vector  x  with  the  intensity  at  its  initial 
point  in  the  interval  between  b  and  b  ♦  db,  that  the  intensity 
at  the  terminal  point  is  in  the  interval  between  a  and  a  ♦  da. 

If  P2(c)  denotes  the  probability  that  the  intensity  at  any 
point  is  in  the  interval  (e,  c  +  dc),  then  it  is  clear  that 

^  ^2^*^  for  all  a  and  b  indicates  no  correlation  for 
the  vector  separation  x.  For  each  direction  there  would  be 
a  smallest  length  vector  satisfying  the  above  relation  and  the 
collection  of  these  vectors  will  prescribe  the  smallest  neighs 
borhood  square  necessary  to  take  full  advantage  of  the  correla¬ 
tion.  In  the  present  linear  filter  case  as  well  as  in  other 


o 


o 


non  linear  cases  a  simpler  measure  of  the  loss  of  correlation 

is  for  n^n^  "  ‘^i j  ^  sufficiently  separated 

points  i  and  J. 

To  incorporate  multiple  wave  lengths  into  the  present  scheme 
would  no  doubt  require  a  considerable  addition  in  instrumenta¬ 
tion  but  for  the  theoretical  formalism  only  a  -minor  notational 
.  change  is  needed.  Let  us  consider  for  simplicity  the  ease  of 
two  wave  lengths  and  Ag*  Of  course  one  way  to  do  this 
would  be  to  design  two  separate  linear  filters  and  consider 
their  output  in  a  parallel  fashion.  The  drawback  here  would  be 
that  no  use  is  made  of  cross-correlation  in  the  two  wave  lengths. 
So  we  will  proceed  as  follows:  Let  the  index,  i,  take  on 
values  from'  1  to  2N  where  N  =  m^  denotes  the  number  of  points 
in  a  neighborhood  square  of  ra  by  m  size.  The  values  i  .  1  .  . 

N  will  refer  to  wave  length  Aj  and  the  values  i  =  N  +  1  .  .  . 

2N  will  refer  to  wave  length  Ag*  Such  an  arrangement  is  both 
necessary  and  sufficient  for  the  filter  to  be  linear  over  both 
wave  lengths.  Thus  for  example  for  i  =  1  and  j  =  N  -f  2 

gives  the  cross-correlation  between  the  intensity  in  A  at 
position  one  and  the  intensity  in  A g  position  two.  It 
should  be  noted  that  the  output  of  this  filter  is  still  in  a 
single  plane  grid. 


2«1»2  The  Quadratic  Filter 

A  polynomial  filter  somewhat  more  general  than  the  previous  one 
IS  a  homogeneous  quadratic  one«  In  this  case  f(x)  =  x'Ax  is 
a  quadratic  form  having  a  symmetric  matrix  A.  If  the  same 
optimum  detection  criterion  as  was  applied  to  the  linear  fil¬ 
ter  is  applied  to  this  one,  the  following  ratio  stands  to  be 
maximized s 

“  =  ~  (6) 

.n.n  J 

ff  ij  1  j 

The  subscript  indices  take  on  values  from  1  to  N,  i.e.,  the 
numbering  of  points  in  a  neighborhood  square.  The  a. .  are 
elements  of  the  matrix  A  while  CT  and  n.  are  the  signal  and 
noise  intensities,  respectively.  The'  bar  indicates  an  average 
over  an  ensemble  of  noise  backgrounds.  To  maximize  R  the  a 

ij 

must  satisfy  the  following  equations; 

-^“ij^ijkl  =  **^k^l  f®"*  1  =  1,  0  o  ,  N  (7) 

^ijkl  "i"j“k“l  represent  a  correlation  in 

the  noise  at  a  quartet  of  points,  i,  j,  k,  1  as  distinct  from 
the  autocorrelation  =  n^n^  which  represents  a  correlation 

in  the  noise  at  a  pair  of  points,  i,  j.  Since  there  are 


equations  in  the  unknowns  It  is  olear  that  in  general 
there  will  be  a  unique  set  of  a^^  determined  and  hence  a 
unique  function  f.  The  "b"  is  an  arbitrary  constant  whieh 
appears  for  the  same  reason  as  "a"  in  equation  (4),  nam»ly 
because  the  ratios  to  be  maximised  are  homogeneous  of  degree 
sero. in  the  unknown  constants. 

The  adaptation  of  the  quadratic  filter  to  a  point  source  and 
multiple  wave  lengths  proceeds  along  lines  exactly  parallel 
to  those  of  the  linear  filter. 

If  one  proceeds  to  non  homogeneous  polynomials  or  to  ones 
of  higher  degree  an  increased  knowledge  of  correlation  in 
the  noise  is  demanded.  A  more  general  and  satisfying  ap¬ 
proach  is  to  assume  a  somewhat  complete  knowledge  of  correla- 
tion  in  the  noise  and  then  proceed  to  find  the  function  f 
which  will  maximize  the  signal-to-noise  ratio, 

2.2  A  GENERAL  STATISTICAL  FILTER 

For  any  ensemble  of  noise  backgrounds  consider  the  following 
function  “\/(n)  ,  where  the  vector  n  denotes  a  noise  intensity 
distribution  over  a  neighborhood  square  (m  by  m) .  The  image 


plane  has  been  replaced  by  an  input  grid  so  that  the  vector  n 
cones  from  a  Euclidean  N*>space  (N  =  m^) .  The  function  "V'is 
defined  as  a  probability  density  function  for  the  random 
variable  vector  n.  This  means  that  the  integral 

A  /"‘a  ‘  ’  ‘/•n  ^  ^  ^2  •  •  •  “  4  *»> 


gives  the  probability  that  an  arbitrary  neighborhood  of  an 
arbitrary  input  grid  taken  from  the  ensemble  will  have  a  dis¬ 
tribution  of  intensities  satisfying  the  following  inequalities. 
*i  ^  for  i  =  1  .  .  ,  N 

where  n 


The  above  integral  will  be  abbreviated  by  ^  J. 

and  it  is  clear  that  /-V{±)A±  =  1  with  infinite  limits 
understood.  The  size  of  the  neighborhood  square  is  dictated 
by  the  area  over  which  the  noise  has  ’'significant”  correla¬ 
tion  properties.  In  practice  where  the  ensemble  is  somewhat 


restricted  the  function -V  can  always  be  computed.  In  fact, 
regardless  of  the  si;e  of  the  ensemble  the  computed  function 


represents  the  most  complete  statistical  knowledge  consistent 
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with  that  size.  It  should  be  noticed  that  from  the  function  "V 
such  functions  as  the  autocorrelation  and  four  point  correla- 
tion  descriptions  of  the  nol&e  can  be  derived^ 


Analogous  to  the  function  -y(n)  another  probability  density 
function  cr(s)  will  be  introduced.  Assuming  that  the  neigh> 
borhood  square  is  large  enough  to  contain  the  localized  signal 
as  well  as  noise  correlation,  <r(8)  will  describe  the  prob¬ 
ability  with  which  the  signal  takes  on  various  orientations  as 
well  as  positions  in  the  square.  If  several  different  signals 
are  involved  CT  can  incorporate  these  as  well.  More  precisely 
the  integral 

f  "i  f\ 

K,  y .  •  •  y .  "  ?i  "fii  •  •  •  ■‘fH 

1  2 


gives  the  probability  that  an  arbitrary  neighborhood  containing 
a  signal  superimposed  on  an  average  noise  background  will  have  ' 
a  distribution  of  intensities  satisfying  the  following  inequali¬ 
ties. 


ai  <  Si  <  b. 


for  i  =  1,  .  ,  .  N 


(10) 
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With  the  above  definitions  of  -j/  and  <r  it  is  clear  that  the 
average  transmitted  signal  and  the  average  square  of  the 
transmitted  noise  are  respectively 

and  -V(±)  d  2_  (11) 

Thus  the  ratio  to  be  maximized  is 


( trijpi 

^  V(j.)d  i. 

and  the  variational  calculus  is  employed  in  Appendix  B  to  find 
the  unknown  function  f.  Assuming  to  be  the  function 


sought  the  desired  result  is 

.  A  ffff- 


(13) 


with  A  an  arbitrary  constant.  The  ratio  in  Equation  (13)  is 
intuitively  correct  since  any  intensity  distribution  in  a 
neighborhood  square  centered  at  point  P  which  has  a  much  higher 
probability  of  representing  a  signal  than  noise  will  give  a 
strong  output  intensity  at  P' . 


In  thinking  of  <T  (_^)  as  a  scaler  point  function  in  an  N 
dimensional  space,  the  point  source  target  case  corresponds 
to  a  (T  which  is  only  sizable  for  points  in  the  space  which 
have  a  large  central  coordinate  value  for  2L. 
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Thus  the  computation  of  O'  for  the  point  source  case  de¬ 
pends  only  upon  the  average  background  and  the  statistical 
distribut.ion  of  amplitudes,  of  the  source o 

The  multiple  wave  length  case  can  be  handled  in  a  way  similar 
to  that  of  the  linear  filter^  For  the  ease  of  two  wave  lengths 
the  dimension  of  the  vector  ^  is  doubled  so  that  the  first 
half  of  the  coordinates  refer  to  intensities  in  and  the 
second  half  to  intensities  in  The  V  and  er  functions 

then  incorporate  all  of  the  cross^correlation  of  intensities  . 
in  A  j  and  Ag* 

Any  instrumentation  or  simulation  on  a  computer  of  this  kind 
of  a  statistical  filter  requires  a  large  metaory  capacity.  Ways 
of  obtaining  approximate  and  useful  estimates  of  the  -y  and  a~ 
functions  are  at  the  present  time  being  worked  out.  The  filter 
is  somewhat  of  an  ideal  to  be  approached  and  also  shows  how  the 
mapping  function  f(x)  is  completely  determined,  i.e.,  to  with¬ 
in  a  multiplicative  constant,  when  the  background  description 
is  given. 

2.3  A  DECISION  FILTER 

Using  the  same  ~V  and  (7"  functions  as  previously  defined.  It 
will  be  interesting  to  show  how  an  entirely  different  approach 
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to  the  detection  problen  can  lead  to  a  result  strikingly  simi¬ 
lar  to  that  obtained  in  Section  2.2.  Middleton^®^  in  applying 
('9) 

the  work  of  Wald  to  the  detection  of  signals  in  noise  in¬ 
troduces  a  decision  function  §(d^s_^).  Here  (d^s^)  Ki^es 
the  probability  that  no  signal  is  present  in  a  neighborhood 
square  given  the  intensity  distribution  J_.  Similarly 

^^*®  probability  that  a  signal  is  present. 

From  the  definition  it  follows  that 

JCdo-'J.)  ♦  =  1  (14) 

The  optimum  criterion  to  be  used  in  designing  the  decision 

f®  called  that  of  an  ideal  observer^^^.  This  criterion 
minimisses  the  total  average  error  of  which  the  filter  is 
capable.  There  are  two  types  of  errors.  One  of  them,  called 
the  false  alarm,  gives  an  output  of  signal  present  when  no 
signal  is  present.  The  other,  called  the  miss,  gives  the 
output  of  no  signal  present  when  a  signal  is  present.  The 
average  probabilities  of  these  two  types  will  be  denoted  by 
and  o^p,  respectively.  Here  p  and  q  denote  the  a  priori 
probabilities  of  the  presence  or  absence  of  a  signal  with 
p  ♦  q  =  1.  Thus  oC  gives  the  average  conditional  probability 
of  a  miss  while  ^  gives  the  average  conditional  probability 


of  a  false  alarm.  The  relations  giving  OC  and  jS  in  terms  of 
the  decision  function  S  and  the  density  functions  Vand  ^ 
are  as  follows; 

^  *  ycr(^)  S(d^:^)  d 

S  (dj^s  d  ^ 

The  ideal  observer  then  seeks  to  minimise  the  total  average 
error  T  *  P  ♦  ^  q  • 


T  «  P  ya-(^)  §(d^:_^)  d  ^  ♦  q  jCi^)  S  d  ^ 

(16) 

Using  Equation  (14)  the  total  average  error  becomes 

T  =  p  *  -  P<r(^)]  §(dj;^)  d  ^  (17) 

The  choice  of  S  which  minimises  T  is  clearly  the  one  which 
has  ^  =  0  when  the  integrand  is  positive  and  has  ^  s  1  when 
the  integrand  is  negative.  Thus 


or 


qy(^)  -  pr(V)4  0 
i*'?  ‘[2  qV(^)  -  p<7-(^)  >0 

in  more  familiar  terns , 


(18) 


(19) 


In  the  language  of  decision  theory  the  space  of  inputs  has  been 
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partitioned  into  two  parts  such  that  a  yes  or  no  decision  can 
be  associated  with  each  one.  Co-paring  this  result  with  that 
of  Equation  (13)  it  can  be  seen  that  for  A  =  the  filter 
of  Section  2.2  becoaes  a  decision  filter  where  an  Intensity 
at  P'  greater  than  one  is  interpreted  as  a  signal  present  in 
the  neighborhood  square  about  P.  If  the  intensity  is  less 
than  one  no  signal  is  assuaed  present. 


The  adaptation  of  the  decision  filter  to  a  point  source  and  to 
Bultiple  wave  lengths  is  exactly  the  same  as  for  the  general 
statistical  filter.  The  siallarlty  of  the  final  fora  of 
f(x)  for  the  last  two  filters,  considering  their  divergent 
approaches,  is  somewhat  remarkable  and  tends  to  give  con¬ 
siderable  weight  to  that  fora  of  the  function. 

2.4  COMBINING  FILTERS 

Each  filter  which  is  designed  is  designed  for  the  purpose  of 
distorting  the  optical  pl^e  data  in  some  certain  desirable 
way.  The  linear  filter  does  this  by  improving  the  signal-to- 
noise  ratio  using  signal  description  and'noise  autocorrelation. 
The  decision  filter  evaluates  the  ratio  of  the  probabilities 
of  the  input  having  occurred  in  .the  signal  to  its  having  oc¬ 
curred  in  the  noise.  An  infinitude  of  filters  can  be  designed 

which  require  statistical  descriptions  lying  somewhere  between 
these  extremes. 
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Deelsiona  can  be  made  as  to  the  probability  of  a  target's 
presence  or  absence  in  the  input  optical  plane  dependent  on  the 
filter  output.  The  decision  filter  makes  this  decision  in  its 
operation.  Various  types  of  decision  criteria  may  be  used.  For 
example,  the  output  intensity  of  any  filter  may  be  partitioned 
at  some  level,  all  levels  above  the  partitioning  point  indicat- 
ing  signal  presence,  all  points  below  indicating  absence. 

Filters  which  have  a  two-level  output,  indicating  presence  or 
absence,  are  called  property  filters.  Their  effect  is  to  di¬ 
vide  the  universe  of  possible  patterns  into  two  classes,  the 
0  class  and  the  1  class.  Property  filters  as  applied  to  the. 
thesis  problem  would,  for  example,  include  measurements  of  the 
prepence  or  absence  of  the  target  in  several  spectral  ranges. 

Property  filters  may  be  designed  on  any  reasonable  basis.  Their 
evaluation  may  of  necessity  be  empirical.  The  term  as  used  here 
means  simply  that  the  filter  measures  some  property  of  the  image 
and  votes  accordingly. 

Several  decision  processes  may  be  combined  to  enhance  the  ac¬ 
curacy  of  the  overall  decision  process.  It  is  probably  easier 
to  find  a  large  set  of  properties,  each  af  which  gives  a  little 
useful  information  about  the  target,  than  to  find  a  few  which 


art  just  righto  This,  is  sapetially  sa  in.  the.,  adaptive  aystemo 

Two  basic  ways  of  combining  decision  processes  are  illustrated 
in  Figure  8o  The  serial  decision  requires  excellent  discrimi¬ 
nation  ability  at  each  step. 

s 

The  serial  scheme  (a)  minimizes  false  alarms  at  the  expense  of 
permitting  misses o  The  serial  scheme  (b)  minimizes  misses  at 
the  expense  of  permitting  false  alarms o  Either  case  would 
probably  be  intolerable  in  a  system  in  which  the  machine  has  the 
final  sayo  If  the  final  decision  were  human,  scheme  (b)  would 
be  better  as  it  minimizes  misses »  The  parallel  decision  (c) 
does  not  require  such  tight  requirements  on  the  individual  de¬ 
cisions  but  requires  that  they  be  combined  by  some  good  method 
in  the  final  decision  block.  By  good,  it  is  meant  that  the 
outputs  of  highly  discriminating  decision  blocks  are  weighted 

heavily  while  less  reliable  decision  blocks  are  weighted  light¬ 
ly. 


Each  filter  has  associated  with  its  decision  ability  a  prob¬ 
ability  value  of  its  decision  being  correct  or  incorrect.  Th® 
purpose  of  a  design  which  retains  some  residual  adaptation  is 
to  improve  the  reliability  of  the  decision  as  time  goes  on. 
Thus,  the  autocorrelation  function  in  the  linear  filter  is 
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No  Target  Target  Target  No  Target 

Simple  Serial  Decision  Schemes 


Decision  of  Target  Decision  of  Target  (c) 

Absence  Presence 

Parallel  Decision  Scheme 

Figure  8 

Decision  Filter  Combinations 


continually  updated  so  that  it  remains  representative  of  the 
noise.  It  seems  clear  that  the  property  filters  used  in  a 
detection  scheme  should  be  designed  with  the  problem  con¬ 
straints  in  mind. 

The  problem  of  designing  a  system  which  uses  these  concepts 
breaks  itself  rather  naturally  into  two  parts.  The  first  is 
to  design  property  filters  which  are  in  some  sense  optimum  and 
which,  again  in  some  sense,  retain  residual  adaptability. 

The  second  is  to  use  the  decisions  made  by  these  property 
filters  so  as  to  improve  the  overall  decision  ability  of  the 
system.  Elach  filter  has  four  probabilities  associated  with 
it;  Pjj  the  probability  that  it  decides  on  a  target  being 
present,  given  that  a  target  is  present.  the  probability 

that  It  decides  on  a  target  being  present,  given  that  a  target 
is  not  present..  P^j  .the  probability  that  it  decides  against* 
a  target  being  present,  given  a  target  is  present.  P^^  the 
probability  that  it  decides  against  a  target  being  present, 
given  a  target  is  not  present.  Ideally,  P^^  and  P^^  are  each 
of  value  1  and  P^j  and  are  zero.  The  adherence  to  this 
optimum  is  the  goal  of  the  filter  design.  If  this  were  possible 
foT  a  single  filter  then  there  would  be  no  need  of  using  more 


than  one. 


The  decision  filter,  sec  2.3,  is  a  single  filter  which  minimizes 
miss  and  false  alarm  decisions,  and  Therefore  it  is 

ideal.  The  quantity  of  data  which  it  must  keep  available,  how¬ 
ever,  is  prodigious  and  this  is  a  limitation  which  may  not  be 
easily  eliminated.  For  example,  if  it  were  to  look  at  ten  se¬ 
quential  images  (time)  of  twenty  x  twenty  points,  and  each 
point  had  a  ten  level  number  associated*  with  it,  the  number 
of  possibilities  which  can  occur  and  which  must  be  accounted 
for  is  10^®°°,  i.e.,  4,000  points  taking  on  10  levels  inde¬ 
pendently.  It  is  obvious  that  some  compromises  must  be  made  in 
the  instrumentation  of  such  a  filter  which  will  tend  to  make 
it  less  than  ideal. 

The  filters  discussed  in  secs  2.1,  2,2,  2.3  of  this  report 
used  in  several  spectral  regions  are  optimum  in  the  sense  in 
which  they  were  deri^ved.  They  will  nevertheless  have  success 
probabilities  associated  with  them  and  if  the  design  criteria 
are  reasonable  their  success  probabilities  will  be  much  larger 
than  chance. 

Likelihood* estimates  can  be  assembled  from  the  P. .  probabili- 
ties  that  have  been  introduced.  These  are  being  considered 
at  the  present  time  as  a  method  of  combining  decision  filters 
in  parallel. 


2.5  SIMULATION 

In  order  to  develop  some  feeling  for  what  can  be  attained  by 
the  use  of  different  filtering  and  detection  criteria,  computer 
simulations  of  systems  using  these  criteria  are  being  established. 
This  is  a  two  purpose  program.  First,  to  see  how  well  the  math¬ 
ematics  applies,  and  second  to  obtain  some  idea  of  the  requisite 
complexity  of  a  system  using  these  methods. 

The  hardware  being  used  in  the  simulations  is; 

1»  A  picture  scanner  (still  under  construction)  which 
partiUons  a  photograph  into  100  x  100  equally  spaced 
points  and  generates  a  number  for  each  point  propor¬ 
tional  to  the  average  intensity  over  a  circle  centered 
on  that  point.  The  circles  are  situated  so  that  they 
touch  but  do  not  overlap.  These  numbers  are  then  con¬ 
verted  into  a  ten-level  digital  code  and  placed  on  IBM 
magnetic  tape. 

2.  An  IBM  7090  high  speed  digital  computer  with  a  core 
storage,  of  about  30,000  words.  The  picture  contains 
10,000  computer  words  so  two  pictures  plus  program 
^ii  easily  into  the  core. 
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3.  An  output  device,  the  SC  4020  microfilm  recorder  which 
prints  a  100  x  100  point  picture.  The  technique  used 
is  to  partition  the  output  field  of  an  oscilloscope 
into  100  X  100  equal  squares  and  to  print  a  number  of 
dots  in  each  square  dependent  upon  the  grey  level, 
i.e.,  a  7-level  is  7  dots,  a  3  is  3  dots,  etc.  The 
SC  4020  recorder  utilizes  a  Characteron  shaped  beam 
tube.  The  output  oscilloscope  picture  is  photographed. 
Variations  of  this  approach  using  the  SC  4020  are  under 
investigation  to  see  what  improvements  can  be  made. 

The  manipulation  of  the  picture  data  is  dependent  upon  the  com¬ 
puter  programming.  Commensurate  with  the  computer  capabili¬ 
ties,  subroutines  have  been  (and  are  being)  written  so  that  new 
processing  ideas  can  be  checked  by  writing  a  driver  program 
which  calls  for  subroutine  as  required.  For  example,  the  auto¬ 
correlation  function  is  obtained  for  a  picture  by  calling 
AUTOCOR  and  stating  the  core  location  of  the  picture.  Print¬ 
outs  of  a  picture  at  any  stage  of  processing  is  also  available 
as  a  subroutine.  . 

A  numerical  picture  output  is  available  when  desired  by  using 
an  IBM  720  high  speed  printer,  A  picture  printout  subroutine 


has  been  written  which  gives  a  picture  output  of  up  to  100  x  100 
points  with  a  decimal  number  printed  at  each  point  to  represent 
the  level.  For  reasons  of  size  compression,  wc  have  been  nor¬ 
malizing  and  quantizing  the  output  into  grey  levels  of  zero  to 
nine . 

2»5.1  Autocorre lation 

The  autocorrelation  program  evaluates 

0(R)  =  -^-SF(r)F  (r+R) 

where  R  is  the  vector  separation  between  points  F(r)  and 
F(r+R),  the  intensities  at  the  sampling  points,  and  M  is  the 
number  of  such  points  considered,  r  is  a  dummy  scanning  vec¬ 
tor.  0(R)  is  a  measure  of  the  statistical  influence  of  the 
value  at  one  point  in  a  picture  upon  the  value  at  a  point 
separated  by  a  vector  distance  R. 

Since  (r+R)  does  not  necessarily  lie  in  the  picture  matrix, 
the  computation  prescribed  is  evaluated  by  considering  only 

those  values  of  r  which  permit  R  to  terminate  on  points  in 
the  matrix.  . 

4 

The  autocorrelation  function  at  the  origin,  R  =  is  the 
average  square  of  the  function.  For  arguments  larger  than  the 


distance  over  w^ioih  one  point  has  j.nfluence  over  another,  the 
autocorrelation  function  approaches  the  s^juare  of  the  average 
value  of  the  functiono 

Figures  9  and  10  are  examples  of  the  autocorrelation  function. 
The  "a”  picture  is  the  input,  F(r) ,  The  "b”  picture  is  the 
autocorrelation  function,  0(R) , 

Figure  9,  a  number  4  and  its  autocorrelation  function, demon¬ 
strates  two  properties  of  the  autocorrelation  function.  One, 
the  autocorrelation  function  is  independent  of  translation 
of  the  image  being  transformed.  Two,  although  structural 
information  is  lost,  the  distance  over  which  correlation  is 
important  can  be  inferred  from  the  image  data.  Note  the  strong 
correlation  in  the  direction  of  the  three  lines  making  up  the 
input  image. 

Figure  10,  numbers  between  0  and  9  taken  from  a  random  number 
table  £uid  its  autocorrelation,  illustrates  the  lack  of  sta¬ 
tistical  influence  on  one  point  over  another  when  each  point 
is  completely  independent, 

2,5,2  Neighborhood  Modification 

The  neighborhood  modification  linear  weighting  program  performs 
a  linear  weighting  of  the  picture  intensity  values  on  the  input 
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a)  The  oiuneral  4 


b)  Autocorrelation  of  the  numeral  4,  The  center 
value,  shown  as  zero,  is  a  ten  level.  The 
other  levels  are  normalized  to  the  center  level. 


Figure  9 

Autocorrelation  of  Numeral  4 
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a)  Ten  level  randon  noise. 


b)  Autocorrelation  of  (a).  The  center  value, 
shown  as  zero  is  a  ten  level.  The  other 
levels  are  normalized  to  the  center  level. 


Figure  10 

Autocorrelation  of  Random  Noise 
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plane  %n  a  square  of  up  to  55  x  25  points  and  plots  the  sum 
at  a  point  in  output  plane  at  a  point  corresponding  to  the 
center  o/  the  square.  It  does  this  for  every  possible  center 
on  a  picture  of  size  up  to  100  x  100  points.  See  Figure  3. 
Linear  weighting  of  this  type  has  a  direct  analogy  with  the 
one-dimensioaal  linear  filter.  Since  values  on  all  sides 

of  the  point  being  processed  are  available,  zero  phase  shift 
filters  are  possible, 

2,5»3  Equivalence 

The  equivalence  subroutine  equates  vector  relations  between 

points  to  matrix  relations  between  entries. 

■ 

• 

For  example,  the  autocorrelation  between  points  with  a  sepa¬ 
ration  of  one  horizontal  element  in  the  image  plane,  0(1,0), 
i»  the  same  for  all  i  and  j  points  of  the  neighborhood  with 
this  separation.  There  are  2n(n-l)  +  1  free  and  different 

autocorrelation  vectors  that  will  fit  into  an  nxn  neighbor¬ 
hood  square. 


Watrix  Inversion 

The  matrix  inversion  subroutine  solves  the  matrix  equation, 


a  0“ 
n 


i 
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In  the  filter  simulation,  are  the  filter  constants  to  be 
found,  is  tbs  autocorrelation  of  the  noise  betveen  points 

1  and  n,  and  is  the  target  description  plus  sverage  noise. 

This  is  followed  by  a  filter  design  subroutine  which  places  the 
k's  in  the  neighborhood  wodification  schewe. 

2.5.5  The  Linear  Filter  Siwufation 

The  optiwuw  filter  according  to  the  criterion  given  in  2.1 
was  simulated  as  shown  in  Figure  11,  with  exawplss  shown 

in  Figures  12  and  IS. 

Pictures  of  wise  Si  s  28  wleasnts  were  read  into  the  computer 
by  band.  100  x  100  element  pictures  will  be  read  in  when  tbs 
photo  scanner  is  completed.  The  program  solves  for  an  optimum 
filter  from  noise  and  signal  data. to  improve  the  signal«to> 
noise  ratio  for  a  signal  ♦  noise  picture. 

The  output  are  normalized  to  the  highest  picture  value  and 
quantized  into  levels  0  to  9. 

This  filter  is  easy  to  instrument  and  an  inexhaustive  variety  of 
modifications  of  the  general  idea  are  also  poaaibie  without 
requiring  much  addition.  1»  the  detection  filter  maintenance 
of  target  shape  was  not  included  as  a  constraint.  Thus,  its 
only  goal  in  life  is  to  peak  up  ac  the  pgint  when^it  has  a 
target  centered  in  the  neighborhood.  It  is  seen  to  do  this. 
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Figure  11 

Computer  Flow  Chart  for  the  Simulation  of 
the  Linear  Filter 
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a)  Dumbbell  noise  with¬ 
out  target 


b)  Dumbbell  noise  plus 
target 


c)  Dumbbell  noise  plus 
target  filtered  by  a 
3x3  neighborhood. 


d)  Dumbbell  noise  plus 
target  filtered  by  a 
5x5  neighborhood. 


Figiu*e  12 

Simulated  Linear  Filter  (Case  1) 
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a)  Random  numbers  plus 
target 


b)  Random  numbers  plus 
target  filtered  by  a 
3x3  neighborhood 


c)  Random  numbers  plus 
target  filtered  by  a 
5x5  neighborhood 

Figure  13 

Simulated  Linear  Filter  (Case  2) 


The  presence  of  a  target  on  the  inage  plane  can  be  deternined 
by  either  man  or  machine.  This  filter,  which  does  not  maintain 
structure  of  the  target  is  probably  better  when  using  machine 
interrogation  since  human  decision  is  generally  based  on  struc¬ 
ture.  This  will  be  investigated  further  when  real  data  become 
available . 

The  target  used  was  the  same  in  the  two  cases  which  have  been 

run.  It  consists  of  a  3  x  3  square  with  a  center  value  of 

level  9  and  peripheral  values  of  level  4,  i.e., 

444 

494 

444 

It  was  placed  in  the  same  location  in  both  cases,  i.e.,  centered 
on  the  13*  point  from  the  right  and  the  91*  point  from_the  bot¬ 
tom  of  the  signal  plus  noise  input  picture. 

For  Case  1,  Figure  12,  the  noise  considered  is  dumbbell  shaped. 
The  dumbbells  are  oriented  at  an  angle  of  135*  with  respect  to 
the  positive  x  axis.  Bach  dumbbell  consists  of  two  point  source 
intensities  of  10  level  separated  by  a  distance  of  2  o(.~^  (  oC  is 
the  spacing  of  the  elements  in  the  image  plane).  The  dumbbells 
were  placed  in  the  image  picture  according  to  a  random  number 
table  so  that  they  occurred  at  a  relative  frequency  of  1  in  20. 
Thus,  on  the  average,  one  element  in  ten  is  sampling  one  of  the 


sources . 
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In  the  computer  simulation  we  let  oC  =  1,  and  the  vector 
correlation  between  points  is  expressed  as  0  (o<,y6)  where 
is  the  horizontal  separation  and  is  the  vertical  sepa¬ 
ration  between  points.  The  correlation  0  (0,  0)  was  computed 
to  be  10,40  and  0  (-2,  2)  was  computed  to  be  5.67.  These 
agree  quite  well  with  the  values  estimated  for  a  larger  en¬ 
semble  of  input  noise  where  these  values  are  easily  seen  to 
be  10  and  5,  respectively.  Other  correlation  values  were 
found  which  were  higher  than  the  expected  limiting  values  of 
1,  (square  of  the  average  value  of  the  input),  the  highest 
being  0  (2,  2)  =  2,27,  These  values  ranged  from  ,21  to  2,27 
and  were  used  in  the  simulation  as  is  instead  of  assuming 
their  limiting  values.  Thus  the  effect  of  having  insufficient 
noise  data  to  work  from  entered  the  simulation  in  a  rather 
natural  way. 

Figure  12a  is  a  picture  of  the  noise  input.  It  is  normalized 
to  the  highest  value,  a  20  caused  by  two  points  overlapping. 
Figure  12b  is  the  same  picture  with  the  addition  of  the  tar¬ 
get,  Figure  12c  is  the  signal  plus  noise  picture  after  filter¬ 
ing  using  a  3  X  3  neighborhood  in  which  the  neighborhood  values 
as  found  by  the  computer  are 
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This  seens  intuitively  correct.  Values  along  the  135*  where 
noise  correlation,  in  high.,  are.  deeiiq>haaize<L. 


Figure  12d  is  the  same  as  12c  except  that  the  neighborhood 
is  5  X  5  with  values  of 
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These  neighborhood  values  again  sees  intuitively  correct  since 
they  show  deemphasis  along  the  axis  of  the  dumbbell  noise. 


The  intensity  over  the  target  is  seen  to  peak  for  both  filters. 

Figure  13  (Case  2)  is  the  result  of  a  second  simulation  of  this 
filter  using  a  different  noise  background.  The  noise  in  this 
case  has  amplitudes  given  by  a  rectangular  probability  density 
distribution  of  values  between  0  and  9.  The  values  at  the 
points  are  completely  uncorrelated  with  values  at  other  points. 
This  is  a  sample  of  the  same  noise  used  to  generate  the  auto¬ 
correlation  values  with  little  fluctuation  from  the  expected 
ratio  0(o(.,^)/0(O,  0)  =  .7,  oC  ^0,^/0. 

Figure  13a  is^  the  target  plus  noise  normalized  to  the  highest 
value.  Figure  13b  is  the  same  picture  filtered  by  a  3  x  3 


nct^borhood  and  13e  la  tha  pletura  again  but  flltared  by  a 
8x5  nalghborhOAii. 

Th#  fact  that  tha  targat  loaas  Its  shapa  la  sonawhat  dalatarlous 
to  tha  humans'  ability  to  find  tha  targat  avan  iriian  proeaaaad. 
This  is  baeausa  tha  human  looks  for  struetura  parhaps  without 
groat  ragard  to  Intanslty. 
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APPENDIX  A 


Referring  to  equation  (2)  on  page  19  the  denominator  can  be 
written  as 


c.c.9^. 
1  3  13 


and  the  numerator  as 


where 


or  =  s^  +  ^ ,  and  ^  =  n^. 


Thus  equations  (3)  on  page  23  become 

for  all  values  of  n^  or  more  simply 


(1') 


(2*) 


for  all  n,  where  a  is  independent  of  n.  Notice  that  9^  -  ^ 

ij  ”  ”31 

and  so  using  standard  matrix  notation  with  k  representing  the 
vector  solution  of  the  c^  we  obtain, 

i  -  *  ®  -ST  (3.) 

corresponding  to  equation  (4)  on  page  23. 


Since  equation  (5)  represents  a  critical  point  of  R  then  under 
the  assumptions  listed  there  a  necessary  and  sufficient  con¬ 
dition  for  this  to  represent  an  absolute  maximum  is  for 


R(k-t-h)  -  R(k)  =  F(h)  to  be  non  positive  for  all  vector  dis¬ 
placements,  h.  If  h  is  parallel  to  k  it  is  clear  that  F{t^  =  0 

for  the  value  of  R  is  not  sensitve  to  the  undetermined  scalar  a 
in  k. 

Substituting  k  into  R  we  obtain 

R(k)  =  (a  =  ^*  ][  (4' 

The  quantity  ^  will  be  denoted  by  oC  and  since  the 
positive  definiteness  of  J  implies  that  of  we  have  o(^0 

for  a  non  trivial  target  and  background. 


Next  for  R(k+h)  we  have 


R(k+h)  =  (ao<+h«_^  /  (aV +2ah'jr+h’  J  h) 

(S') 

or  letting  h'^  =  ^(h)  and  h'  f  h  =  ^  (h)  , 

R(k+h)  =  (ao<+7)2  /  (a^^+2a?'+y3  ) . 

(6'  ) 

Thus 

F(h)  =  /  (oCa^+2  r'a+^). 

(7') 

Considering  the  denominator  of  equation  (?•)  as  a  function  g(a) 
of  a,  its  discriminant  is  four  times  the  numerator.  If 
were  positive  for  a  single  value  of  h  then  an  a  could  be  found 
such  that  g(a)>0  so  the  F(h)^0  and  k  is  not  an  absolute  maxi¬ 
mum.  If  -  7^  is  non  negative  for  all  h  then  g(a)>  0  (except 
when  o<  ^  ^  =0  and  then  for  only  one  value  of  a  does  g(a)  =  0) 


A-3 


and  F(h)  is  non  positive  so  that  k  is  an  absolute  maxlmun. 
Arranging  terse 

f fh)  -  (h»j2:)^  = 
fe'  (o<f  -  ^•)  h  (S') 

so  that  the  necessary  and  sufficient  condition  reduces  to  the 
positive  semi-definiteness  of  the  matrix  -  .27^'  as  stated 


on  page  23 
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APPHTOIX  B 


In  the  variational  approach  to  maxiaizing 

we  aasuae  the  existence  of  a  maximizing  function  and 

then  consider  a  family  of  "neighboring"  functions  ♦ 

6  S(Jj  ♦  where  €  is  an  arbitrary  real  value  and  S^Jj 
an  arbitrary  function  which  vanishes  on  the  boundary  of  the 
N-dimensional  space.  The  boundary  in  the  present  ease  can  be 
taken  at  infinity.  If  +e  S(Jj  i*  substituted  for 

f(l)  in  equation  (9*)  the  R  becomes  a  function  of  6  and 

shown  that  a  necessary  condition  for  a 
relative  maximum  of  R  is  that  3  R/Be  =  0  for  all  S{Jj  and 
for  ^  =0.  Thus  we  proceed  with 


(9*) 


*e  S(±)J  cr(X)  dX)2 

(10*) 

+6  S'  ^2p2  ^  V(^)  d  ^ 

to  obtain  from  f 9  R/ ^  ^  =  0  the  following, 

( yV^v)(  =  ( yVtf-)(yVs->/)/ 

(11') 

Letting  pf  cr  =  s  and  J  Cf>^-y/ ^  X  have 

i/y)  ^  =  0. 

(12*) 

The  assumption  that  the  integrand  of  equation  (12*)  is 
continuous  with  S  arbitrary  implies  that 
s(tcr -  s^V)  =  0  for  all 

Thus,  with  non  trivial  signal  and  noise,  so  that  s  ^  0, 
t  /  0  we  have 


(13*) 


(14*) 
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